Finding New Information Via Robust Entity Detection
نویسندگان
چکیده
Journalists and editors work under pressure to collect relevant details and background information about specific events. They spend a significant amount of time sifting through documents and finding new information such as facts, opinions or stakeholders (i.e. people, places and organizations that have a stake in the news). Spotting them is a tedious and cognitively intense process. One task, essential to this process, is to find and keep track of stakeholders. This task is taxing cognitively and in terms of memory. Tell Me More offers an automatic aid to this task. Tell Me More is a system that, given a seed story, mines the web for similar stories reported by different sources and selects only those stories which offer new information with respect to that original seed story. Much like a journalist, the task of detecting named entities is central to its success. In this paper we briefly describe Tell Me More and, in particular, we focus on Tell Me More’s entity detection component. We describe an approach that combines off-the-shelf named entity recognizers (NERs) with WPED, a publicly available NER that uses Wikipedia as its knowledge base. We show significant increase in precision scores with respect to traditional NERs. Lastly, we present an overall evaluation of Tell Me More using this approach.
منابع مشابه
A Method for Body Fat Composition Analysis in Abdominal Magnetic Resonance Images Via Self-Organizing Map Neural Network
Introduction: The present study aimed to suggest an unsupervised method for the segmentation of visceral adipose tissue (VAT) and subcutaneous adipose tissue (SAT) in axial magnetic resonance (MR) images of the abdomen. Materials and Methods: A self-organizing map (SOM) neural network was designed to segment the adipose tissue from other tissues in the MR images. The segmentation of SAT and VA...
متن کاملSetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble
Corpus-based set expansion (i.e., finding the “complete” set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous ...
متن کاملA Simple and Efficient Skew Detection Algorithm via Text Row Algorithm
document image processing, skew detection An important part of any document recognition system is detection of skew in the image of a page. This paper presents a new, accurate and robust skew detection algorithm based on a method for finding rows of text in page images. Results of a test of the new algorithm and a comparison against Baird's well known algorithm on 400 pages show the new algorit...
متن کاملNamed Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملDesign of robust controller by neuro-fuzzy system in a prescribed region via state feedback
In this paper, first a new algorithm for pole assignment of closed-loop multi-variable controllable systems in a prescribed region of the z-plane is presented. Then, robust state feedback controllers are designed by implementing a neural fuzzy system for the placement of closed-loop poles of a controllable system in a prescribed region in the left-hand side of z-plane. A new method based on the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010